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METHOD FOR PRE-SUPPRESSING NOISE OF IMAGE 

Field of the Technology 

The present invention relates generally to multimedia communication field, and 
more particularly to a method for pre-suppress noise of an image at transmitting end. 

Background of the Invention 

In general, a video image includes massive amounts of data. It is infeasible to 
communicate and to store a video image directly. Examples of communicating video 
images are videoconferences, video telephones and remote teaching, etc. Examples of 
storing video images are multimedia databases, VCD and DVD, etc. In a video 
service system, except video data, there are also audio data, T.120 data and control 
information, etc. At the same time, it is highly required that a video service system is 
real time, interactive and having good image quality. Therefore, the video image data 
should be compressed with a high compression rate. At present, there are international 
standards for video image compression, H.261 and H.263 are standardized by ITU-T, 
and MPEGl, MPEG2 and MPEG4 are standardized by ISO. In these standards, some 
present important compression techniques are involved, and they have many common 
issues, such as they all use Common Intermediate Format (GIF), they all use hybrid 
mode of Motion-Compensated Prediction and Discrete Cosine Transform (DCT), etc. 

In a video system, implemented with the above standards, it is unavoidable that 
an acquired video image involves some noise. For example, there are high-frequency 
impulse noise, caused by the large changes in luminance components and 
chrominance components at a small region of an image, and random noise, generated 
by A/D conversion and quantization during signal sampled. Without suppressing the 
noise before compression, the compression efficiency will be greatly decreased. 
Therefore, in order to have a better compression efficiency and a good image quality, 
it is necessary to have a pre- and post-processing to reduce or eliminate the noise in an 
image. 

There are many existing methods for suppressing noise. In general, each one of 
them uses an adequate filtering method in the spatial domain or frequency domain. 
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The filtering methods can be divided into: linear filtering, such as one-dimension 
finite impulse response (1-D FIR) filtering, two-dimensions finite impulse response 
(2-D FIR) filtering, etc, and non-linear filtering, such as median filtering, threshold 
filtering, etc. 

Reference to the US 5787203 patent, titled "Method and system for filtering 
compressed video images", which discloses a method for filtering in the spatial 
domain. A nonlinear filtering method is used for the differences of images after 
motion-compensated prediction and before DCT. There are two times of filtering: first 
a threshold filter that reduces or eliminates random noise, then a cross-shaped median 
filter that reduces or eliminates high frequency impulse noise. They reduce the overall 
compression bitrate by 10% to 20%. 

Reference to the US 5325125 patent, titled "Intra-frame filter for video 
compression systems", which discloses a method with linear filtering directly before 
compression. The method applies a two dimensional (2-D) filter to filter out high 
frequency components and high frequency impulse noise in the image diagonal 
direction. 

In order to reduce or eliminate noise effectively, in the present suppressing noise 
methods, a one-dimension or two-dimension fiUering process is added in the original 
processing procedure. Nevertheless, in general, a filtering calculation takes time and 
is a heavy load for a system. So, for the video conference system, which highly 
requires real time and interactive, it is necessary to have an suppressing noise method 
which does not take time and keeps the original image as much as possible. In the 
present filtering technology, the threshold filtering is a method with less amount of 
calculation. The basic principle of this method is: at a transmitting end, quantizing 
DCT coefficients of a GIF image blocks is in preset sequence; when a DCT 
coefficient is equal or less than a predetermined threshold, the DCT coefficient is set 
to zero; and when a DCT coefficient is greater than the predetermined threshold, the 
DCT coefficient is unchanged. In essence, the method of pre-suppressing noise is a 
constant threshold filtering method. The disadvantage of these pre-suppressing noise 
methods is that selection of a threshold is conflict. If a smaller threshold is selected, a 
higher image quality can be obtained; but with too small number of DCT coefficient 
zero value, the compression efficiency is not satisfied. If a larger threshold is selected. 
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the compression efficiency is satisfied; but with too many number of DCT coefficient 
zero value, the image quality is not satisfied. 

Summary of the Invention 

Purpose of the invention is to provide a method with an adaptive threshold 
fihering to pre-suppress noise in an image. Comparing with the constant threshold 
filtering method, the method can further increase image compression efficiency 
without decreasing the image quality. 

Purpose of the invention is implemented as follows. A pre-suppressing noise 
method for an image comprises the following steps. At the transmitting end, 
quantizing DCT coefficients of a CIF image blocks, is in sequence. If the currently 
processed DCT coefficient value is equal or less than a corresponding threshold used 
for this time, the DCT coefficient will be set to zero. At the same time, the threshold 
of this time is increased and will be used as a threshold of next time for the DCT 
coefficient processing. If the increased threshold is greater than a predetermined upper 
limit of a threshold, then the increased threshold is substituted with the predetermined 
upper limit of a threshold. If the currently processed DCT coefficient value is greater 
than the corresponding threshold, used for this time processing, the threshold is 
recovered to a predetermined initial threshold, which will be used as the next time 
threshold for the next processing of DCT coefficient. 

The method for pre-suppressing noise of an image further includes the 
followings. At transmitting end, transforming a video image from the CCIR 601 
format to the CIF applies a low-pass FIR filter, in which the cut-off frequency is less 
than 0.57T. Wherein the CCIR stands for Consultative Committee on International 
Radio, and CCIR 601 is the 601 standard proposed by CCIR, which defines a digital 
video standard for broadcasting corresponding to 525 lines to 625 lines television 
system. The luminance component and chrominance component are filtered by the 
low-pass FIR filter at vertical and horizontal direction, respectively. 

Said low-pass FIR filter is one-dimensional FIR filter. 

The cut-off firequency of said low-pass FIR filter is between 0.25k to 0.4k. 
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After the luminance component and chrominance component have been 
processed by a low-pass FIR filter with less than O.Sn cut-off frequency, the image is 
further filtered by a two-dimensional median filter. 

If the currently processed DCT coefficient value is equal or less than a 
corresponding threshold used for this time, the DCT coefficient will be set to zero. At 
the same time, the threshold of this time is increased by one and will be used as a 
threshold of next time for the DCT coefficient processing. 

Said DCT coefficients is processed in sequence is that the sequence is started 
from the upper left comer of a DCT coefficient table and is ordered in a zigzag pattern. 

Said an initial threshold and an upper limit of threshold can be predetermined in 
that the initial threshold equals to kQ and the upper limit of threshold equals l.5kQ, 
Wherein Q is the quantized level and A: is a constant between zero to one determined 
by the channel bandwidth. 

The method for pre-suppressing noise in an image can be understood as a 
filtering method with an adaptive threshold. Comparing with the constant threshold 
filtering method, the noise can be more efficiency suppressed without decreasing the 
image quality, and the image compression efficiency is increase further. Of course, 
when an image is filtered with a nonlinear threshold, along with noise to be reduced 
or eliminated, part of boundary information of the image will be lost. Nevertheless, 
the human eye is highly sensitive at low-frequency component in an image, whereas 
the sensibility is greatly reduced at high-frequency component. Therefore, under an 
allowable range of human vision perception, the method can be used to obtain a better 
image quality. Another advantage of the adaptive threshold filtering is small amount 
of calculation, which is suitable for a video service system with high requirement of 
real time and interactive. 

Brief Description of the Drawings 

Fig.l shows a DCT coefficient distribution of 8x8 image blocks with H.26X (i.e. 
H261, H263, etc.) format. 

Fig.2 shows a fi*amework of an embodiment of the pre-suppressing noise method 
for an image according to the present invention. 
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Fig.3 shows a processing flowchart of adaptive threshold fihering in the Fig,2 
embodiment. 

Fig.4 shows a framework of another embodiment of the pre-suppressing noise 
method for an image according to the present invention. 

Embodiments of the Invention 

The invention is described in detail, hereinafter, with reference to drawings. It 
should be noted that this is a description of the invention but not a limitation of the 
invention. 

Fig. 2 shows a framework of an embodiment of the pre-suppressing noise method 
for an image, i.e. a fihering position in a H,26X video compression process. As shown 
in Fig,2, in this embodiment there are two filters: a one-dimensional FIR filter and an 
adaptive threshold filter. The one-dimensional FIR filter is a linear high-frequency 
impulse noise filter, and the adaptive threshold filter is a nonlinear random noise filter. 
These two filters work at spatial domain and frequency domain, respectively. In the 
following, the one-dimensional low-pass FIR filter and the adaptive threshold filter 
are described, respectively. 

As shown in Fig.2, during a video image is being transformed from CCIR 601 
format to CIF format, an one-dimensional FIR filter processes luminance component 
and chrominance component at horizontal and vertical direction, respectively; and the 
high-frequency impulse noise included in an image is suppressed. Naturally, a 
corresponding high-frequency part of the image is also filtered. Nevertheless, the 
human eye is highly sensitive at low-frequency component in an image, whereas the 
sensibility is greatly reduced at high-frequency component. By applying a low-pass 
filter with appropriate cut-off frequency, a better image quality can be obtained within 
the allowable range of hurnan vision perception. 

For cut-off frequency selection of one-dimensional FIR low-pass filter in this 
embodiment, a spectrum analysis will be taken by taking an image as one-dimensional 
signal. Comparing fitting curves of an image spectrum before H.26X compression and 
the image spectrum after reconstruction, differences between these two fitting curves 
are larger. After filtering with a smaller cut-off frequency (< O.Stt) filter, differences 
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of these two fitting curves are smaller. With the difference situation, a cut-off 
frequency range can be defined. In general, it is between 0.257i to 0.471. Taking a 
seven order filter as an example, in this embodiment a low-pass FIR filter can be a 
filter with 0.471 cut-off fi-equency and h = { -2/256, 12/256, 66/256, 104/256, 66/256, 
12/256, -2/256 }. 

After filtering with one-dimensional FIR filter and H,26X compression, the 
compression bitrate of an image can be reduced by 10% to 20%. With a same 
quantized level, the image quality is no difference. Nevertheless, in a practical H.26X 
system, the general control strategy is that when buffer is decreased, the quantized 
level will be decreased to rise the image quality. 

Fig.l shows DCT coefficients distribution of 8x8 image blocks with H.26X 
format. As shown in Fig.l, after DCT transformation, low-fi-equency components of 
the 8x8 image block are distributed at the upper left part, and high-frequency 
components are distributed at the lower right part. The low-frequency components 
correspond to the image detail, and the high-frequency components correspond to the 
boundary part information of the image and noise, such as random noise and high- 
frequency impulse noise. Since most part of the high-frequency impulse noise has . 
been filtered after filtering, so the main noise to be filtered is random noise, here. 

In this embodiment, the adaptive threshold filter scans a DCT coefficient table in 
a zigzag pattern, and 64 DCT coefficients are processed in sequence. The processing 
flowchart is shown in Fig.3 in detail. In Fig. 3, *DCT_Coeff is a pointer of DCT 
coefficients. The threshold of a DCT coefficient Thresh is correlated with the 
quantized level Q. Comparing with a constant threshold, a varied threshold is more 
suitable to the change of image content and has advantage for protection the image 
important content, Thresh_Max is the upper limit of the threshold, and takes \.5kQ, 
The initial threshold takes kQ^ wherein k is ranged between 0.0 to 1.0 and selected 
depending on the channel bandwidth. When the channel bandwidth is smaller, a larger 
k value will be selected; for example, when the channel bandwidth is less than 
384kbps, k will take 1 .0. When the channel bandwidth is larger, a smaller k will be 
selected; for example, when the channel bandwidth is 2Mbps, k will take 0.4. This 
treatment has advantage for obtaining as many as possible continuous zero. If a DCT 
coefficient is equal or less than threshold Thresh, the DCT coefficient will take value 
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zero and the threshold Thresh will be added by one. When threshold Thresh is greater 
than the upper limit Thresh_Max, the threshold Thresh will take the Thresh_Max, If a 
DCT coefficient is greater than threshold Thresh, then the threshold Thresh will be 
recovered to the initial threshold kQ, This treatment has advantage for protecting 
important boundary information of an image. 

Of course, when an image is filtered with a nonlinear threshold, noise will be 
reduced or eliminated, but at the same time, the boundary part information of the 
image will be lost. Nevertheless, the human eye is highly sensitive at low-frequency 
component in an image, whereas the sensibility is greatly reduced at high-frequency 
component. Therefore, under an allowable range of human vision perception, the 
method can be used to obtain a better image quality. 

The embodiment is a high efficiency filtering method to reduce or eliminate 
video image noise. The method combines linear filtering and nonlinear filtering, and 
combines spatial domain filtering and frequency domain filtering together. The 
method applies one linear filter and one nonlinear filter. Therefore, with smaller 
amount of calculation, high-frequency impulse noise and random noise in a video 
image can be reduced or eliminated. In the first time filtering, a filter with lower cut- 
off fi:"equency is applied to transform the format without increasing any amount of 
calculation. In the second time filtering, when quantizing a DCT coefficient, an 
adaptive threshold filter is applied. It can be seen from Fig.3 that there is only 
increased a very small amount of calculation. By using the embodiment method to 
reduce or eliminate noise, the compression bitrate will be reduced by 10% to 30%, so 
the video compression efficiency and image quality will be risen. 

Fig.4 shows a framework of another embodiment of the invention, another 
embodiment framework of a pre-suppressing noise method for an image. This 
embodiment is based on the previous embodiment and added a 3x3 two-dimensional 
median filer set at the position before the image transformed to the CIF format. Since 
the amount of calculation is larger, the median filter is better implemented by 
hardware or DSP software. The median filter has characteristics which is better to 
reduce or eliminate impulse noise and is also better to keep image boundary 
information. So combining with the later processing by adaptive threshold filtering, a 
better reducing or eliminating noise effect of an image can be obtained. 



7 



PCT/CN02/00n0 



It will be appreciated that the present invention is in no way limited to the 
embodiments, mentioned above. The preferred embodiments are only used for a detail 
description of the invention. It should be apparent to those skilled in the art that the 
invention can be modified or equivalently replaced without departing from the scope 
and spirit of the invention. Accordingly, we claim as our invention all such 
embodiments as come within the scope of the following claims and equivalents 
thereto. 
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